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Abstract. We study possible formulations of algebraic propositional proof systems operating 
with noncommutative formulas. We observe that a simple formulation gives rise to systems at 
least as strong as Frege — yielding a semantic way to define a Cook-Reckhow (i.e., polynomially 
verifiable) algebraic analog of Frege proofs, different from that given in [BIK + 97, GH03]. We 
then turn to an apparently weaker system, namely, polynomial calculus (PC) where polynomials 
are written as ordered formulas (PC over ordered formulas, for short): an ordered polynomial 
is a noncommutative polynomial in which the order of products in every monomial respects a 
fixed linear order on variables; an algebraic formula is ordered if the polynomial computed by 
each of its subformulas is ordered. We show that PC over ordered formulas is strictly stronger 
than resolution, polynomial calculus and polynomial calculus with resolution (PCR) and admits 
polynomial-size refutations for the pigeonhole principle and the Tseitin's formulas. We conclude 
by proposing an approach for establishing lower bounds on PC over ordered formulas proofs, 
and related systems, based on properties of lower bounds on noncommutative formulas. 

The motivation behind this work is developing techniques incorporating rank arguments 
(similar to those used in algebraic circuit complexity) for establishing lower bounds on propo- 
sitional proofs. 
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1. Introduction 

This work investigates algebraic proof systems establishing propositional tautologies, in which 
proof lines are written as noncommutative algebraic formulas (noncommutative formulas, for 
short). Research into the complexity of algebraic propositional proofs is a central line in proof 
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complexity (cf. [Pit97, Tza08] for general expositions). Another prominent line of research 
is that dedicated to connections between circuit classes and the propositional proofs based on 
these classes. In particular, considerable efforts were made to borrow techniques used for lower 
bounding certain circuit classes, and utilize them to show lower bounds on proofs operating 
with circuits from the given classes. For example, bounded depth Frege proofs can be viewed 
as propositional logic operating with AC circuits, and lower bounds on bounded depth Frege 
proofs use techniques borrowed from AC circuits lower bounds (cf. [Ajt88, KPW95, PBI93]). 
Pudlak et al. [Pud99, AGP02] studied proofs based on monotone circuits — motivated by known 
exponential lower bounds on monotone circuits. Raz and the author [RT08b, RT08a, Tza08] 
investigated algebraic proof systems operating with multilinear formulas — motivated by lower 
bounds on multilinear formulas for the determinant, permanent and other explicit polynomials 
[Raz09, Raz06]. Atserias et al. [AKV04], Krajicek [Kra08] and Segerlind [Seg07] have considered 
proofs operating with ordered binary decision diagrams (OBDDs). 

The current work is a contribution to this line of research, where the circuit class is noncom- 
mutative formulas. The motivation behind this work is the hope that certain rank arguments, 
found successful in lower bounding the size of certain algebraic circuits, might facilitate also in 
establishing lower bounds for the corresponding algebraic proofs. For this purpose, the choice of 
noncommutative formulas is natural, since such formulas constitute a fairly weak circuit class, 
and the proof of exponential-size lower bounds on noncommutative formulas, given by Nisan 
[Nis91], uses an especially transparent rank argument. 

We will show that for certain formulations of propositional proof systems over noncommu- 
tative formulas demonstrating lower bounds is likely to be hard, as the systems we get are 
considerably strong, and specifically, at least as strong as Frege proofs. On the other hand, 
by using a fairly restricted formulation of proofs operating with noncommutative formulas, we 
obtain a system that we show is strictly stronger than known algebraic proof systems (like the 
polynomial calculus). For this apparently weaker system, demonstrating lower bounds seems 
not to be outside the reach of current techniques. In particular, we propose to study the com- 
plexity of these proofs by measuring the maximal rank of a polynomial appearing in a proof, 
instead of the maximal degree (the latter is done in the polynomial calculus). It is known 
that the rank of a noncommutative polynomial (as defined for instance by Nisan [Nis91]) is 
proportional to the minimal size of a noncommutative formula computing the polynomial. We 
argue for the usefulness of measuring the maximal rank of a polynomial in algebraic proofs, by 
demonstrating a certain property of ranks of "ordered polynomials" (as defined formally), and 
relating it to proof complexity lower bounds (via an example of a conditional lower bound). 

1.1. Results and related works. We concentrate on algebraic proofs establishing proposi- 
tional contradictions where polynomials are written as noncommutative formulas. We deal with 
two kinds of proof systems — both are variants (and extensions) of the polynomial calculus (PC) 
introduced in [CEI96]. In PC we start from a set of initial polynomials from F[xi, . . . ,x n 1, the 
ring of polynomials with coefficients from F (the intended semantics of a proof-line p is the 
equation p = over F). We derive new proof-lines by using two basic algebraic inference rules: 
from two polynomials p and q, we can deduce a -p+(3 ■ q, where a, f3 are elements of F; and from 
p we can deduce Xi -p, for a variable Xi (i = 1, . . . , n). We also have Boolean axioms xf — xi = 0, 
for all i = 1, . . . , n, expressing that the variables get the values or 1. Our two proof systems 
extend PC as follows: 

(1) PC over noncommutative formulas: NFPC. This proof system operates with noncom- 
mutative polynomials over a field, written as (arbitrarily chosen) 1 noncommutative for- 
mulas. The rules of addition and multiplication are similar to PC, except that multi- 
plication is done either from left or right. We also add a a Boolean axiom xiXj — XjXi 
that expresses the fact that for 0, 1 values to the variables, multiplication is in fact 
commutative. 

(2) PC over ordered formulas: OFPC. This proof system is PC operating with ordered 
polynomials written as (arbitrarily chosen) ordered formulas. An ordered polynomial is 
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a noncommutative polynomial such that the order of products in all monomials respects 
a fixed linear order on the variables, and an ordered formula is a noncommutative 
formula in which every subformula computes an ordered polynomial. 

Both proof systems are shown to be Cook-Reckhow systems (that is, polynomial verifiable, 
sound and complete proof systems for propositional tautologies). 

(1) The first proof system NFPC is shown to polynomially simulate Frege (this is partly 
because of the choice of Boolean axioms). This gives a semantic definition of a Cook-Reckhow 
proof system operating with algebraic formulas, simpler in some way from that proposed by 
Grigoriev and Hirsch [GH03]: the paper [GH03] aims at formulating a formal propositional 
proof system for establishing propositional tautologies (that is, a Cook-Reckhow proof sys- 
tem), which is an algebraic analog of the Frege proof system. In order to make their system 
polynomially- verifiable, the authors augment it with a set of auxiliary rewriting rules, intended 
to derive algebraic formulas from previous algebraic formulas via the polynomial-ring axioms 
(that is, associativity, commutativity, distributivity and the zero and unit elements rules). In 
this framework algebraic formulas are treated as syntactic terms, and one must explicitly apply 
the polynomial-ring rewrite rules to derive one formula from another. Our proof system NFPC 
is simpler in the sense that we get a similar proof system to that in [GH03], while adding no 
rewriting rules (both our proof system and that in [GH03] can simulate Frege and both are poly- 
nomially verifiable and operate with algebraic formulas, or in our case with noncommutative 
formulas). The idea is that because we use noncommutative formulas as proof-lines, to verify 
that a lines was derived correctly from previous lines we can use the deterministic polynomial 
identity testing algorithm for noncommutative formulas devised by Raz and Shpilka [RS05] (and 
so we do not need any rewriting rules). 

(2) For the second proof system, OFPC, we show that, despite its apparent weakness, it is 
stronger than Polynomial Calculus with Resolution (PCR; and hence it is also stronger than 
both PC and resolution), and also can polynomially simulate a proof system operating with 
restricted forms of disjunctions of linear equalities called R°(lin) (introduced in [RT08a]). The 
latter implies polynomial-size refutations for the pigeonhole principle and the Tseitin graph 
formulas, due to corresponding upper bounds demonstrated in [RT08a]. 

We then propose a simple lower bound approach for OFPC, based on properties of products 
of ordered formulas (these properties are proved in a similar manner to Nisan's lower bound 
on noncommutative formulas, by lower bounding the rank of matrices associated with noncom- 
mutative polynomials). We show certain sufficient conditions yielding super-polynomial lower 
bounds on OFPC proofs. 

Note: All the results in this paper hold when one considers algebraic branching programs 
(ABPs) instead of noncommutative formulas, and ordered- ABPs instead of ordered-formulas. 
For the precise definition of ABP see e.g., [Nis91]. An ordered- ABP is an ABP such that the 
order of variables appearing on the edges of every path from source to sink on the ABP graph, 
respects a fixed linear order on the variables (see [JQS10] for a close model called 7r-ordered 
ABP). 

Related work. There is some resemblance between noncommutative formulas (and in fact, 
algebraic branching programs) and ordered binary decision diagrams (OBDDs) (e.g., close tech- 
niques were used to obtain polynomial identity testing algorithms for noncommutative formulas 
[RS05] and for OBDDs [Waa97]). Thus, proofs operating with noncommutative formulas are 
reminiscent to the OBDD-based proof systems introduced in [AKV04, Kra08, Seg07]. Never- 
theless, one difference between OBDD-based proofs and noncommutative formulas-based proofs 
is that the feasible monotone interpolation lower bound technique is applicable in the case of 
OBDD-based systems, while this technique does not known to lead to super-polynomial size 



This means that if a proof-line consists of the polynomial p, then one may choose to write any formula that 
computes p. (These kind of systems are sometimes called "semantic" proof systems.) 

3 



lower bounds even on PC proofs (and thus, also on OFPC proofs which are shown to polyno- 
mially simulate PC proofs). 

Another proof system, that is even closer to OFPC, is that operating with multilinear for- 
mulas introduced in [RT08b] (under the name fMC). The upper bounds on OFPC proofs are 
similar to that shown for multilinear proofs in [RT08b]. Moreover, the technique used by Raz 
to establish super-polynomial lower bounds on multilinear formulas in [Raz09] is close — though 
more involved — to that used by Nisan in the lower bound proof for noncommutative formulas 
[Nis91]. Therefore, proving lower bounds on OFPC proofs might help in establishing lower 
bounds on multilinear proofs as well. 

2. Preliminaries 
For a natural number we let [n] = {1, . . . , n}. 

2.1. Noncommutative polynomials and formulas. Let F be a field. Denote by 
¥[xi, . . . ,x n ] the ring of (commutative) polynomials with coefficients from F and variables 
x%, . . . ,x n . We denote by ¥(x\, . . . ,x n ) the noncommutative ring of polynomials with coef- 
ficients from F and variables x\, . . . , x n . In other words, ¥(x\, . . . , x n ) is the ring of polynomials 
(where a polynomial is a formal sum of products of variables and field elements) conforming 
to all the polynomial-ring axioms excluding the commutativity of multiplication axiom. For 
instance, if two different variables, then x% ■ Xj and two different polynomials 

in F(xi, . . . ,x n ) (note that variables do commute with field elements). 

We say that A is an algebra over F, or an ¥-algebra, if A is a vector space over F together 
with a distributive multiplication operation; where multiplication in A is associative (but it 
need not be commutative) and there exists a multiplicative unity in A. 

A noncommutative formula is just a (commutative) arithmetic formula, except that we take 
care for the order in which products are done: 

Definition 2.1 (Noncommutative formula). Let ¥ be a field and xi,X2,--- be variables. A 
noncommutative algebraic formula is a labeled tree, with edges directed from the leaves to the 
root, and with fan-in at most two, such that there is an order on the edges coming into a node 
(the first edge is called the left edge and the second one the right edge). Every leaf of the tree 
(namely, a node of fan-in zero) is labeled either with an input variable xi or a field ¥ element. 
Every other node of the tree is labeled either with + or x (in the first case the node is a plus gate 
and in the second case a product gate). We assume that there is only one node of out-degree 
zero, called the root. An algebraic formula computes a noncommutative polynomial in the ring 
of noncommutative polynomials ¥(x\, . . . , x n ) in the following way. A leaf computes the input 
variable or field element that labels it. A plus gate computes the sum of polynomials computed 
by its incoming nodes. A product gate computes the noncommutative product of the polynomials 
computed by its incoming nodes according to the order of the edges. (Subtraction is obtained 
using the constant —1.) The output of the formula is the polynomial computed by the root. The 
depth of a formula is the maximal length of a path from the root to the leaf. 

The size of an algebraic formula (and noncommutative formula) / is the total number of 
nodes in its underlying tree, and is denoted |/|. 

Raz and Shpilka [RS05] showed that there is a deterministic polynomial identity testing (PIT) 
algorithm that decides whether two noncommutative formulas compute the same noncommu- 
tative polynomial: 

Theorem 2.1 (PIT for noncommutative formulas [RS05]). There is a deterministic polynomial- 
time algorithm that decides whether a given noncommutative formula over a field ¥ computes 
the zero polynomial 0. 2 



'We assume here that the field F can be efficiently represented (e.g., the field of the rationals). 
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2.2. Polynomial Calculus. Algebraic prepositional proof systems are proof systems for finite 
collections of polynomial equations having no 0, 1 solutions over some fixed field. (Formally, 
each different field yields a different algebraic proof system.) Proof- lines in algebraic proofs 
(or refutations) consist of polynomials p over the given fixed field. Each such proof-line is 
interpreted as the polynomial equation p = 0. To consider the size of algebraic refutations we 
fix the way polynomials inside refutations are written. 

Notation: An inference rule is written as jj or A C B , meaning that given the proof- line A one 
can deduce the proof-line B, or given both the proof-lines A, B one can deduce the proof-line 
C, respectively. 

The Polynomial Calculus is a prepositional algebraic proof system first considered in [CEI96]: 

Definition 2.2. (Polynomial Calculus (PC)). Let ¥ be some fixed field and let Q = 
{Qi, ■ ■ ■ ,Q m } be a collection of multivariate polynomials from ¥[x\, . . . ,x n ]. Let the set of 
axiom polynomials be: 

Boolean axioms: Xi ■ (1 — Xi) , for all 1 < i < n . 
A PC proof from Q of a polynomial g is a finite sequence tt = (pi,-.-,pe) of multivariate 
polynomials from ¥[x\, . . . , x n ], where pe = g and for every 1 < i < £, either pi = Qj for some 
j € [m], or pi is a Boolean axiom, or pi was deduced from pj , pk , for j,k < i, by one of the 
following inference rules: 



Product: 
Addition: 



P 



x r ■ p 
P 



for 1 < r < n . 



for a,b G F . 



a ■ p + b • q 

A PC refutation of Q is a proof of 1 (which is interpreted as 1 = 0, that is the unsatisfiable 
equation standing for false,) from Q. The degree of a PC-proof is the maximal degree of a 
polynomial in the proof. The size of a PC proof is the total number of monomials (with nonzero 
coefficients) in all the proof-lines. 

Important note: The size of PC proofs can be defined as the total formula sizes of all proof- 
lines, where polynomials are written as sums of monomials, or more formally, as (unbounded 
fan-in depth-2) XII formulas. 3 This complexity measure is equivalent — up to a factor of n — to 
the usual complexity measure counting the total number of monomials appearing in the proofs 
(Definition 2.2). 

Definition 2.3. (Polynomial Calculus with Resolution (PCR)). The PCR proof system 
is defined similarly to PC (Definition 2.2), except that for every variable Xi a new formal variable 
xi and a new axiom Xi + X{ — 1 are added to the system, and the Boolean axioms of PCR are 
as follows: 

Boolean axioms: Xi ■ Xi . 
The inference rules, and all other definitions are similar to that of PC. Specifically, the size of 
a PCR proof is defined as the total number of monomials in all proof-lines (where now we count 
monomials in the variables Xi and Xi). 

2.3. Proof systems and simulations. Let L C X* be a language over some alphabet X. A 
proof system for a language L is a polynomial-time algorithm A that receives x € X* and a 
string 7r over a binary alphabet ("the [proposed] proof of x), such that there exists a n with 
A{x, tt) = true if and only if x G L. Following [CR79], a Cook-Reckhow proof system (or a 
propositional proof system) is a proof system for the language of propositional tautologies in 



'A EII formula F is an algebraic formula whose underlying tree is of depth 2 and has unbounded fan-in, such 
that the root is labeled with a plus gate, the children of the root are labeled with product gates and the leaves 
are labeled with either variables or field elements. 
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the De Morgan basis {true, false, V, A, -1} (coded in some efficient [polynomial-time] way, e.g., in 
the binary {0, 1} alphabet). 

Assume that V is a proof system for the language L, where L is not the set of propositional 
tautologies in De Morgan's basis. In this case we can still consider V as a proof system for 
propositional tautologies by fixing a translation between L and the set of propositional tautolo- 
gies in De Morgan basis (such that x S L iff the translation of x is a propositional tautology 
[and such that the translation can be done in polynomial-time]). If two proof systems V\ and 
V2 establish two different languages L\,Li2, respectively, then for the task of comparing their 
relative strength we fix a translation from one language to the other. In most cases, we shall 
confine ourselves to proofs establishing propositional tautologies or unsatisfiable CNF formulas. 

A propositional proof system is said to be a propositional refutation system if it establishes 
the language of unsatisfiable propositional formulas (this is clearly a propositional proof system 
by the definition above, since we can translate every unsatisfiable propositional formula into its 
negation and obtain a tautology). 

Definition 2.4. Let Vi,T > 2 be two proof systems for the same language L (in case the proof 
systems are for two different languages we fix a translation from one language to the other, as 
described above). We say that V2 polynomially simulates V\ if given a V\ proof (or refutation) 
7r of a F, then there exists a proof (respectively, refutation) of F in V2 of size polynomial in the 
size of it. In case V2 polynomially simulates V\ while V\ does not polynomially simulates V2 
we say that V2 is strictly stronger than V\ . 

3. Polynomial calculus over noncommutative formulas 

3.1. Discussion. In this section we propose a possible formulation of algebraic propositional 
proof systems that operate with noncommutative polynomials. We observe that dealing with 
propositional proofs — that is, proofs whose variables range over 0, 1 values — makes the variables 
"semantically" commutative. Therefore, for the proof systems to be complete (for unsatisfiable 
collections of noncommutative polynomials over 0, 1 values), one may need to introduce rules or 
axioms expressing commutativity. We show that such a natural formulation of proofs operating 
with noncommutative formulas polynomially simulate the entire Frege system. 

This justifies — if one is interested in concentrating on propositional proof systems weaker than 
Frege (and especially on lower bounds questions) — our formulation in Section 4 of algebraic 
proofs operating with noncommutative algebraic formulas with a fixed product order (called 
ordered formulas). The latter system can be viewed as operating with commutative polynomials 
over a field precisely like PC, while the complexity of proofs is measured by the total sizes of 
ordered formulas needed to write the polynomials in the proof. In other words, the role played 
by the noncommutativity in this system is only in measuring the sizes of proofs: while in PC- 
proofs the size measure is defined as the number of monomials appearing in the proofs — or 
equivalently, the total size of formulas in proofs in which formulas are written as (depth-2) £11 
circuits — the proof system developed in Section 4 is measured by the total ordered formula size. 

3.2. The proof system NFPC. We now define a proof system operating with noncommuta- 
tive polynomials written as noncommutative algebraic formulas. 

In algebraic proof systems like the polynomial calculus we transform unsatisfiable proposi- 
tional formulas into a collection Q of polynomials having no solution over a field F. In the 
noncommutative setting we translate unsatisfiable propositional formulas into a collection Q of 
noncommutative polynomials from ¥(x\, . . . ,x n ) that have no solution over any noncommuta- 
tive F-algebra (e.g., the matrix algebra with entries from F). Although our "Boolean" axioms 
will not force only 0, 1 solutions over noncommutative F-algebras, they will be sufficient for 
our purpose: every unsatisfiable propositional formula translates (via a standard polynomial 
translation) into a collection Q of noncommutative polynomials from ¥(x\, . . . ,x n ), for which 
Q and the Boolean axioms have no (common) solution in any noncommutative F-algebra. Fur- 
thermore, the Boolean axioms will in fact force commutativity of variables product — as required 
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for variables that range over 0, 1 values (although, again, the Boolean axioms do not force only 
0, 1 values when variables range over noncommutative F-algebras). 

Definition 3.1 (Polynomial calculus over noncommutative formulas: NFPC). Fix afieldF and 
let Q := . . . , Qm} be a collection of noncommutative polynomials from W(xi, . . . ,x n ). Let 
the set of axiom polynomials be: 
Boolean axioms: 

Xi ■ (1 — Xi) , for all 1 < i < n . 
Xi ■ Xj — Xj ■ Xi , for all 1 < i ^ j < n . 

Let tt = (pi, . . . ,pi) be a sequence of noncommutative polynomials from ¥(x\, . . . ,x n ), such 
that for each i € [£], either pi = qj for some j € [m], or pi is a Boolean axiom, or pi was 
deduced by one of the following inference rules using pj , p^ , for j,k < i : 
Left/right product: 



p p 



Addition: 

p q 



for 1 < r < n 



for a,b G F . 



a ■ p + b • q 

We say that tt is an NFPC proof of pe from Q if all proof-lines in tt are written as noncommu- 
tative formulas. (The semantics of an NFPC proof-line pi is the polynomial equation pi = 0.) 
An NFPC refutation of Q is a proof of the polynomial 1 from Q. The size of an NFPC proof 
7r is defined as the total sizes of all the noncommutative formulas in tt and is denoted \tt\. 

Remark: (i) The Boolean axioms might have roots different from 0, 1 over noncommutative 
F-algebras. (ii) The Boolean axioms are true for 0, 1 assignments: = for all 

Xi,xj € {0, 1}. 

We now show that NFPC is a sound and complete Cook-Reckhow proof system. First note 
that we have defined NFPC with no rules expressing the polynomial-ring axioms (the latter are 
sometimes added to algebraic proof systems operating with algebraic formulas for the purpose 
of verifying that every formula in the proof was derived correctly [via the deduction rules of the 
system] from previous lines; see discussion in Section 1.1). Nevertheless, due to the deterministic 
polynomial-time PIT procedure for noncommutative formulas (Theorem 2.1) the proof system 
defined will be a Cook-Reckhow system (that is, verifiable in polynomial-time [whenever the 
base field and its operations can be efficiently represented]). 

Proposition 3.1. There is a deterministic polynomial-time algorithm that decides whether a 
given string is an NFPC-proo/ (over efficiently represented fields). 

Proof. We can assume that the proof also indicates from which previous lines a new line was 
inferred via the NFPC inference rules. Then, by Proposition 2.1, there is a polynomial-time 
algorithm that, e.g., given two noncommutative formulas Fi,F2 such that the proof indicates 
that i<2 was inferred from F\ via the Left product rule, decides whether the formula x^ x F\ 
and i<2 computes the same noncommutative polynomial. And similarly for the other deduction 
rules of NFPC. □ 

Proposition 3.2. The systems NFPC is sound and complete. Specifically, let Q be a collection 
of noncommutative polynomials from F(xi, . . . ,x n ). Assume that for every F-algebra, there is 
no 0, 1 solution for Q (that is, an 0, 1 assignment to variables that gives all polynomials in Q 
the value 0), then the contradiction 1 = can be derived in NFPC from Q. 

Proof. Soundness holds because both rules of inference are sound over any F-algebra. Com- 
pleteness stems by the simulation of T-VC shown in Theorem 3.3 below (and the fact that if no 
F-algebra has a solution then also there is no solution in F itself, which implies, by completeness 
of J--VC, that there exists an T-VC refutation of Q). □ 
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For the next statements we use the algebraic propositional proof system T-VC introduced 
by Grigoriev and Hirsch [GH03] as an algebraic analog of the Frege system. The proof system 
T-VC is an algebraic propositional proof system operating with (general, that is, commutative) 
algebraic formulas over a field, and it includes auxiliary rewriting rules allowing to develop 
equal polynomials syntactically via the polynomial-ring axioms. The proof system T-VC has 
the Boolean axioms of PC, the rules of PC and in addition the rewrite rules expressing the 
polynomial-ring axioms. Each line in T-VC is treated as a term, that is, a formula, and so the 
rules are also syntactic: addition of terms via the plus gate and product of a term by a variable 
from the left. We first need to define the notion of a rewrite rule: 

Definition 3.2 (Rewrite rule). A rewrite rule is a pair of formulas f,g denoted f —> g. Given 
a formula <5 ; an application of a rewrite rule / — > g to $ is the result of replacing at most one 
occurrence of f in & by g (that is, substituting a subformula f inside <I? by the formula g). We 
write f <r-> g to denote the pair of rewriting rules f ' — > g and g — > / ' . 

Definition 3.3 {T-VC [GH03]). Fix a fieldF. Let F := {fx, . . . , f m } be a collection of formulas 4 
computing polynomials from F[xi, . . . ,x n ). Let the set of axioms be the following formulas: 
Boolean axioms: xi ■ (1 — Xi) , for all 1 < i < n . 

A sequence tt = (<&i, . . . , 3>f) of formulas computing polynomials from ¥[xx, ■ ■ ■ ,x n ] is said to 
be an T-VC proof of §t from F, if for every i G [£] we have one of the following: 

(1) $j = fj , for some j G [to]; 

(2) <J?j is a Boolean axiom; 

(3) &i was deduced by one of the following inference rules from previous proof-lines , 
forj,k < i: 



Product: 



x r ■ <£ 

Addition: 



for r £ [n] . 



for a, b G F . 



a-^ + b-0 ' 

(Where <3?, x r a • <&,b • Q are formulas constructed as displayed; e.g., x r ■ $ is the 

formula with product gate at the root having the formulas x r and $ as children.) 5 
(4) 3>i was deduced from previous proof-line for j < i, by one of the following rewrit- 
ing rules expressing the polynomial-ring axioms (where f,g,h range over all algebraic 
formulas computing polynomials in ¥[x±, . . . ,x n ]): 

Zero rule: • / <H> 

Unit rule: 1 • / <H> / 

Scalar rule: f f> a, where t is a formula containing no variables (only field ¥ ele- 
ments) that computes the constant a G F. 
Commutativity rules: / + g «-> g + / , / • g o g ■ f 
Associativity rule: / + (g + h) 4+ (/ + g) + h , f ■ (g ■ h) 4+ (/ • g) ■ h 
Distributivity rule: / ■ (g + h) -H- (/ • g) + (/ ■ h) 

(The semantics of an T-VC proof-line pi is the polynomial equation pi = 0.) An T-VC refutation 
of F is a proof of the formula 1 from F . The size of an T-VC proof ir is defined as the total 
sizes of all formulas in tt and is denoted by \ir\. 

Theorem 3.3. NFPC (over any field) poly nomially- simulates Frege. Specifically, NFPC 
polynomially- simulates T-VC in the following sense: let fi, ■ ■ ■ , f m be a set of commutative 
formulas computing (commutative) polynomials that have no common 0,1 root, and assume 



Note here that we are talking about formulas (treated as syntactic terms), and not polynomials. Also notice 
that all formulas in T-VC are (commutative) formulas computing (commutative) polynomials. 

^In [GH03] the product rule of T-VC is defined so that one can derive Q ■ $ from $, where O is any formula, 
and not just a variable. However, the definition of T-VC in [GH03] and our Definition 3.3 polynomially-simulate 
each other. 
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that there is a size s T-VC refutation of fi, . . . , f m . Then, there exists an NFPC refutation of 
the same set of formulas fi, ■ ■ ■ , f m (but now viewed as computing noncommutative polynomials) 
of size polynomial in s. 

Proof. By [GH03] (see Theorem 3 there), T-VC polynomially simulates Frege. We proceed by 
showing a simulation of T-VC by NFPC by induction on the number of steps in an T-VC proof. 

Base case: Axioms and initial formulas. All axioms of T-VC are also axioms in NFPC. Also, if 
the T-VC refutation uses an initial formula fi, then we use the same formula in NFPC. 

Induction step: 

Case 1: Addition rule. Assume we derive in T-VC the formula p + q. By induction hypothesis 
we already have the two formulas p, q in NFPC. Thus, we can add them via the addition rule. 
Case 2: Product rule. Assume we derive the formula Xi ■ p from the formula p in T-VC. By 
induction hypothesis we already have the formula p in NFPC. Thus, we can derive Xi ■ p by the 
Left product rule. 

Case 3: Rewriting rules. Assume we derived a formula / using one of the rewriting rules 
of T-VC. The rewriting rules of associativity, distributivity, scalar rule, and unit and zero 
rules of T-VC do not change the noncommutative polynomial computed by an algebraic for- 
mula. Therefore, we get them "for free" in NFPC, in the sense that we can choose to write 
a noncommutative polynomial p in the proof as any noncommutative formula, as long as the 
chosen formula computes the noncommutative polynomial p. Thus, we only need to show how 
to simulate the commutativity rule, namely to show how to simulate commuting a term inside 
a formula. The key lemma for this is the following: 

Lemma 3.4. Let ¥ be any field and let f,g be two noncommutative formulas computing (non- 
constant) polynomials from ¥{x\, . . . , x n ). Then, there is an NFPC proof of size polynomial in 
l/l + \g\ of the formula f-g-g-f. 

Proof. First, we need to show that NFPC allows for substitution of identities inside proofdines. 
Let A, h be noncommutative formulas and assume that the variable z occurs inside A only once. 
Then A[h/z] denotes the noncommutative formula obtained from A by replacing the leaf labeled 
z by the formula h. 

Claim 3.5. Let A be a noncommutative formula, and let z be a variable that occurs only once 
inside A. Let h,h' be two noncommutative formulas h,hf of maximal size s. Then, there is an 
NFPC proof of A[h/z] — A[h'/z] from h — h! of size polynomial in \A\ + s. 

Proof of claim: Straightforward induction on the size of A. ■ claim 

We get back to the proof of Lemma 3.4: proceed by induction on |/| + \g\ > 2. 
Base case: \f\ + \g\ = 2. By assumption the polynomials computed by /, g are both non- 
constant, and so f = Xi and g = Xj, for some i,j € [n]. Therefore, we are done by the Boolean 
axiom XiXj XjXi . 

Induction step: Either j/j > 1 or \g\ > 1. Assume without loss of generality that |/| > 1. 
Following Claim 3.5, we shall use freely substitutions in formulas. 

Case (i): f = fi + f 2 - Start from 

f ■ 9 - f ■ 9 = f ■ 9 - {h + f^) ■ g = f • g - h - 9 - h - 9 • (i) 

By induction hypothesis we have a proof of f\ ■ g — g ■ fi and of fz-g — g-fi- Thus, we can 
substitute these identities in (1), to get / • g - g ■ fi - g ■ f 2 = f ■ g - g ■ (/i + f 2 ) = f ■ g - g • /• 
Case (ii): f = fx ■ f 2 . Start from 



/ • 9 - f ■ 9 = f ■ 9 - (/i • f2) ■ g = f ■ g - fi ■ (h • g) 
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(2) 



By induction hypothesis we have a proof of f 2 ■ g — g ■ f 2 - Thus, we can substitute this identity 
in (2), to get / • g - /i • (g ■ / 2 ) = / • g - (/i • g) ■ f 2 . By induction hypothesis again, we have 
fi-g-g- fi- And similarly, we get by substitution / • g - (g ■ fi) ■ f 2 = f ■ g - g ■ f '. 

This concludes the proof of Lemma 3.4 □ 

To conclude the simulation of the commutativity rewrite rule of J--VC (which will also con- 
clude the proof of Theorem 3.3) we notice that, by Claim 3.5 and by Lemma 3.4, for any 
noncommutative formula A, such that z is a variable that occurs only once inside A, there is 
an NFPC proof of A[(f ■ g) / z] - A[(g ■ f) / z] of size polynomial in \A[(f -g)/z]\. □ 

4. Polynomial calculus over ordered formulas 

In this section we formulate an algebraic proof system OFPC that operates with noncom- 
mutative polynomials in which every monomial is a product of variables in nondecreasing order 
(from left to right; and according to some fixed linear order on the variables), and where poly- 
nomials in proofs are written as ordered formulas, as defined below. 

Let X = {x\, . . . ,x n } be a set of variables and let F be a field. Let ^ be a linear order 
on the variables X. Let / = ^2j,=jbjA4j be a commutative polynomial from ¥[xi, . . . , x n ], 
where the bj's are coefficient from F and the A4j's are monomials in the X variables. We define 
If} € F(xi, . . . , x n ) to be the (unique) noncommutative polynomial bj ■ \MjJ, where \M.j\ 

is the (noncommutative) product of all the variables in A4j such that the order of multiplications 
respects X. We denote the image of the map [•] : ¥[xi, . . . , x n ] — > ¥(x\, . . . , x n ) by Q. We say 
that a polynomial is an ordered polynomial if it is a polynomial from Q. 

Definition 4.1 (Ordered formula). Let < be some fixed linear order on variables x\, . . . ,x n . 
A noncommutative formula (Definition 2.1) is said to be an ordered formula if the polynomial 
computed by each of its subformulas is ordered. We say that an ordered formula F computes 
the commutative polynomial f £ ¥[x\, . . . ,x n ] whenever F computes [/J. 

An equivalent characterization of ordered formulas is as syntactic ordered formulas: 

Definition 4.2 (Syntactic ordered formula). An ordered formula is a syntactic ordered formula 
if for each of its product gates the left subformula contains only variables that are less-than or 
equal, via -<, than the variables in the right subformula of the gate. 

Proposition 4.1. There is a polytime algorithm that receives an algebraic formula and a 
linear order on its variables, and returns false if $ is not an ordered formula, and otherwise 
returns a syntactic ordered formula of the same size as $ that computes the same (ordered) 
polynomial. 

Proof. The algorithm is as follows: Search for a product node in F that has on its left subformula 
a variable that is greater (via the order than some variable in its right subformula. If there 
is no such product node, then F itself is a syntactic ordered formula, and the algorithm returns 
F. 

Otherwise, let v be a product gate in F, with F\ and F 2 its left and right subformulas, 
respectively. And suppose that F\ contains the variable Xi and F 2 contains the variable xj, 
such that Xi >- Xj. Let hi,h 2 be the polynomials computed by F\ and F 2 , respectively. 

We first check whether x^ occurs in h\ . To this end we substitute every appearance of Xi in 
F\ by the constant 0, and check if the resulted formula, denoted iq(0/xj), computes the same 
noncommutative polynomial as F\ (using the PIT algorithm for noncommutative formulas). 
If the answer to the latter question is "yes" , then we conclude that Xi does not occur in the 
polynomial h\, and we run the algorithm with the input formula F in which F\ is substituted 
by Fi(0/xi). If the answer to the question was "no", we check in a similar manner whether Xj 
occurs in h 2 . If Xj does not occur in h 2 we run the algorithm with the formula F in which F 2 
is substituted by F 2 (0/xj) (where F 2 (0/xj) is F 2 after substituting every appearance of Xj by 
0). If Xj does occur in the polynomial h 2 , then the polynomial computed at v is not ordered 
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(since we already know that x% occurs in hi, and so h\ ■ /12 is not an ordered polynomial), and 
so F is not an ordered formula, and we return false. 



Note that the algorithm described above returns either false (in case F is not an ordered 
formula) or a new formula that computes the same (noncommutative) polynomial as F and with 
the same size as F (because the only changes applied to the original formula F is substitution 
of variables by the constant 0). The running time of the algorithm is polynomial in the size of 
F. □ 

We can now define OFPC in a convenient way, that is, without referring to noncommutative 
polynomials: the system OFPC is defined similarly to PC, except that the proof-lines are written 
as ordered formulas: 

Definition 4.3 (PC over ordered formulas (OFPC)). Let tt = (p±, . . . ,p m ) be a PC proof ofp m 
from some set of initial polynomials Q (that is, pi are commutative polynomials from the ring 
of polynomials F[sci, . . . , x n ]), and let ^ be sojTic linear order on the vcividblGs x\ , . . . , x n . The 
sequence (fi,-.-,f m ) in which /, is an ordered formula computing pi (according to the order 
-<), is called an OFPC proof of p m from Q. The size of an OFPC proof is the total sizes of all 
the ordered formulas appearing in it. 

Similar to the proof system NFPC we have defined OFPC with no rules expressing the 
polynomial-ring axioms. Also, similar to NFPC, the system OFPC will constitute a Cook- 
Reckhow proof system, that is, there is a deterministic polynomial-time algorithm that decides 
whether a given string is an OFPC proof or not (whenever the base field and its operations can 
be efficiently represented): 

Proposition 4.2. For any linear order on the variables, OFPC is a sound, complete and 
polynomially-verifiable refutation system for establishing that a collection of (commutative) poly- 
nomial equations over a field does not have 0, 1 solutions. Specifically, ( considering the language 
of polynomial translations of Boolean contradictions) OFPC is a Cook-Reckhow proof system. 

Proof. The soundness and completeness of OFPC stem from the soundness and completeness 
of PC. The fact that OFPC is a Cook-Reckhow proof system is proved in Proposition 4.4 
below. □ 

We first need the following lemma: 

Lemma 4.3. For any linear order ^ on variables, there exists a polytime algorithm that receives 
an ordered formula $ computing [/] G ¥{xi, . . . , x n ) (for some polynomial f € ¥[xi, . . . , x n \) 
and a variable x r , for some 1 < r < n, and outputs a new ordered formula that computes \x r - /J. 

Proof. We can assume that $ is a syntactic ordered formula, as otherwise we can transform it 
into such a formula by using the algorithm in Proposition 4.1. By induction on the size of the 
formula we show that there is an algorithm A(&,x r ) that outputs the correct formula. 

Base case: 

(1) A(c, x r ) := c • x r , for c € F. 

(2) A{ Xi ■ x f'^J • — Xp * x 1 or depending on whether Xi ■< Xf or x 2 Xj> . 
respectively. 

Induction step: 

(1) + $2,2V) := A{<5> llXr ) + A($ 2 ,x r ). 

(2) A{&\ ■ $>2,x r ) := A($>i,x r ) ■ $2) in case Xi is less-than or equal {<) than every variable 
in $2, and otherwise A($i ■ <$>2,x r ) := $1 • A($ 2 ,x r ). 

□ 

Proposition 4.4. For any linear order ^ on variables, there exists a polytime algorithm that 
given a sequence tt of ordered formulas and another sequence Q\, . . . , Q m , g of ordered formulas, 
outputs 1 iff ir is an OFPC proof of the polynomial computed by g from the polynomials computed 
by Qi, . . .,Q m . 
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Proof. We verify the following: 

(1) All formulas in tt are ordered formulas (according to the fixed linear order). By Propo- 
sition 4.1, this can be done in polynomial-time in the size of tt. 

(2) The last formula in tt computes g. This can be done by checking that the last formula 
in tt computes the same noncommutative polynomial as g (using the PIT algorithm for 
noncommutative formulas in Theorem 2.1). 

(3) For every proof-line / € tt one of the following holds: 

(i) The formula / computes an axiom. This can be verified by checking whether / 
computes the same noncommutative polynomial as the formula x\ — x^, for some 
1 < i < n, or whether / computes some polynomial computed by Qi, for some 
1 < i < m (again, by the PIT algorithm for noncommutative formulas). 

(ii) The formula / computes the same ordered polynomial as F + G, for some pair 
F, G of ordered formulas in previous proof-lines (verify by the PIT algorithm for 
noncommutative formulas). 

(iii) The formula / computes \xi ■ hj, for some 1 < i < n, where h is a polynomial com- 
puted by some previous proof-line. To check this we do the following: considering 
a previous proof-line H, we run the algorithm in Lemma 4.3 where the inputs are 
H and X{. We get a new ordered formula H' , and we check if H' computes the 
same noncommutative polynomial as /. 

□ 

Notes: 

(1) In case we assume that there is an apriori fixed linear order of variables, we may speak 
about ordered formulas without referring explicitly to some linear order. 

(2) Formally, for different n's, every set of variables x±,...,x n may have linear orders that 
are incompatible with each other. Nevertheless, in this paper, given a family Q of 
collections of initial polynomials {Q n \ n 6 N} parameterized by n, and assuming that 
Qn C F[ x\, . . . ,x n ] for all 7i, we will consider only linear orders such that: for every 
n > 1, the linear order on x±, . . . , x n is an extension of the linear order on x±, . . . , x n -\. 
Equivalently, we can consider one fixed linear order on a countable set of variables 
X = {X!,X 2 , ■ ■ ■}■ 

5. Simulations, short proofs and separations for OFPC 

In this section we are concerned with the relative strength of OFPC. Specifically, we show 
that OFPC is strictly stronger than the polynomial calculus, polynomial calculus with resolution 
(PCR, for short; see Definition 2.3) and resolution (for a definition, see for example [ABSRW02]). 
For this purpose, we show first that, for any linear order on the variables, OFPC polynomi- 
ally simulates PCR. Since PCR polynomially simulates both PC and resolution, we get that 
OFPC also polynomially simulates PC and resolution. Second, we show that OFPC admits 
polynomial-size refutations of tautologies (formally, families of unsatisfiable collections of poly- 
nomial equations) that are hard (that is, do not have polynomial-size proofs) in PCR. 

Let t denote the linear transformation that maps the variables Xi, for any i £ [n], to (1 — Xj), 
and denote p\r the polynomial p under the transformation r. 

Proposition 5.1. For any linear order on the variables, OFPC polynomially simulates PCR 
(and PC and resolution). Specifically, if there is a size s PCR proof (with the variables 
x\ , . . . , x n , xi , . . . , x n ) of p from the axioms pj 1 , . . . , pj k , then there is an OFPC proof of p \ r 
from Pj^T, ...,p jk \r of size 0(n ■ s). 

Proof. Given some linear order on the variables, we assume that all ordered formulas respect 
this linear order (and so we do not refer explicitly to this order). 

Let 7r = (pi, . . . ,pt) be a PCR proof of size s from the axioms p J15 . . . ,pj k (that is, pj's are 
[commutative] polynomials from ¥[x±, . . . , x n , x±, . . . , x n ], for some field F, such that the total 
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number of monomials occurring in all proof- lines in tt is s). We need to show that there is an 
OFPC proof tt' of pi from the axioms, such that tt' has size 0(n ■ s). 

Let r be the sequence obtained from tt by replacing every product rule application in tt, 
deriving Xi ■ p from p (for any i = 1, . . . , n), by the following proof sequence: 

1. p 

2. Xi-p 

3. (1 -x^ -p 

(the second polynomial is derived by the product rule from the first polynomial, and the third 
polynomial is derived by the addition rule from the first and second polynomials). 

Let Y\t be the sequence obtained from V by applying the substitution r on every proof-line 
in T. We claim that r \ r is a PC proof of pt \ r from the initial polynomials pj 1 f r, . . . ,pj k \ r: 
first, note that all product rule applications using Xi variables were eliminated in T\t, and thus 
all product rule applications in F\t are legitimate PC product rule applications. Second, note 
that for any pair of polynomials g, h we have g \ r + h \ r = {g + h) \ r. Third, note that the 
axioms of PCR transform under r to either (which we can ignore in the new proof sequence) 
or to the PC axiom Xj(l — Xj). 

By construction, every proof- line in T \ r is either Pi \ t or xj ■ (pi \ r), for some pi € tt and 
j G [n]. Therefore, by definition of OFPC, it suffices to show that every pi\r and Xj ■ (pi\r), for 
some pi € tt and j € [n], have ordered formulas of size at most 0{m ■ n), where m is the number 
of monomials in pi. For this purpose it is enough to show that for every monomial in p, t 
there exists an 0{n) ordered formula computing the polynomial jM\t. The latter is true since 
every such polynomial is a product of at most n terms, where each term is either Xi or 1 — Xj, 
for some i € [n]; such a product can be clearly written as an ordered formula of size 0(n). □ 

5.0.1. OFPC polynomially simulates R°(lin). We now show that OFPC can polynomially sim- 
ulate the proof system R°(lin) introduced in [RT08a]. This will be used in Section 5.0.2 to 
establish the OFPC upper bounds. In that paper a refutation system R(lin) was introduced. 
R(lin) is a refutation system extending resolution to work with disjunctions of linear equations 
instead of disjunction of literals. R°(lin) is defined to be a subsystem of R(lin) in which cer- 
tain restrictions put on the possible disjunctions of linear equations allowed in a proof. For 
the precise definition of R(lin) and R°(lin) we refer the reader to [RT08a]. However, it is not 
entirely necessary to know the definitions of R(lin) and R°(lin), since we will use a polynomial 
translation of R°(lin) defined below, and describe explicitly what is needed for the proofs ahead. 
First, we need the definitions that follow. A polynomial translation of a clause \fj G j(x^ j ) is 

a any product of the form njej( x j — fy)> where bj € {0, 1} for all j € J, and where is the 
literal xj if bj = 1 and -iXj if bj = 0. Accordingly, we define the polynomial translation of a 
CNF formula as the set consisting of the polynomial translations of the clauses in a CNF. 

Definition 5.1 (Polynomial translation of R Cj d(lin)-lines). A polynomial translation of an 
R Ci d(lin)-line is a product D = \\- &J Lj, where the Lj's are linear forms, and: 

(1) All variables in the linear forms have integer coefficients with absolute values at most c 
(the constant terms are unbounded). 

(2) D can be written as nf=i where each Di either consists of (an unbounded) product 
of linear forms that differ only in their constant terms, or is a translation of a clause 
(as defined above). 

The width of a polynomial-translation of an R Cj d(lin)-/ine D is defined to be the total degree of 
the polynomial D. 

In other words, any polynomial translation of an R Ci d(lin)-line has the following general form: 




(3) 
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where k < d and for all r € [n] and t € [k], a\- is an integer such that \a\- | < c, and 6j € {0, 1} 
(for all j € J) (and ii, J are unbounded sets of indices). Clearly, a disjunction of clauses 

is a clause in itself, and so we can assume that in any R Cj( i(lin)-line only a single polynomial 
translation of a clause occurs. 

We shall use the following propositions: 

Proposition 5.2 (Algebraic translation of R°(lin); Corollary 9.11 [RT08a] (restated)). Let 
K := {K n | n £ N} be a family of unsatisfiable CNF formulas 6 , and let {P n \ n € N} be a family 
of R°(lin)-proofs of K. Then, there are two constants c,d that do not depend on n and a 
family of PC proofs {P^ \ n € N} of the polynomial translations of the family of CNFs K , such 
that for every n the proof P' n has polynomial- size in the size of P n number of steps, and where 
every line in P' n is a (polynomial translation of an) R Ci d(lin)-line (Definition 5.1) whose width 
is polynomial in the size of P„ . 

Remark: It is immaterial to define the size measure for R°(lin) refutations (though this concept 
is mentioned in Theorem 5.2); we shall only use the fact that R°(lin) has short refutations for 
some hard contradictions. 

Note: Although corollary 9.11 in [RT08a] is stated for PCR instead of PC, the translation 
holds also for PC (see Remark before Corollary 9.11 in [RT08a]). 

Definition 5.2 (Multilinearization operator). Given a field ¥ and a polynomial q G 
¥[x±, . . . ,x n ], we denote by M[g] the unique multilinear polynomial equal to q modulo the ideal 
generated by all the polynomials xf — Xi, for all variables xi. 

For example, if q = x\x2 + ax\ (for some a G F) then M[g] = x\X2 + ax^ . 

Proposition 5.3 (Implicit in [RT08b, RT08a]). Let P be a PCR refutation from initial mul- 
tilinear polynomials. Then we can transform P into a new PCR refutation P' from the same 
initial multilinear polynomials such that P' contains only multilinear polynomials, with only a 
polynomial increase in the number of steps. Moreover, if the proof lines in P are all R C) ^(lin)- 
lines of maximal width w, then all the proof lines in P' are multilinearizations o/R c /^/(lin)-/ine,s 
of maximal width polynomial in w and where c', d' depend only on c,d. 

Proof sketch: Given a PCR proof P = (pi, . . . ,p m ) in the variables {x±, . . . , x n , x±, ... , x n }, 
consider the sequence S of multilinearized polynomials (M[pi] , . . . , M[p m ]). Then, by the proof 
of Theorem 5.1 in [RT08b] one can add polynomially in m many multilinear polynomials to S 
so that the new sequence S' consists of only multilinear polynomials and constitutes a PCR 
refutation of the initial polynomials. (Theorem 5.1 from [RT08b] talks about fMC refutations 
[Definition 2.6 in [RT08b]]. However, it is clear from the definition of fMC that the underlying 
sequence of polynomials in any fMC refutation constitutes a PCR refutation as well.) 

Assume in addition that all polynomials in P are polynomial translations of R Cj d(lin) -lines 
(Definition 5.1). Then, S = (M[pi] , . . . , M[p m ]) is a sequence of multilinearizations of R Ci d(lin)- 
lines. The only thing left to check is that the additional polynomials added to 5 to yield S' in 
the proof of Theorem 5.1 [RT08b] are all polynomial translations o/R c '^/(lin)-/mes, where c', d' 
depend only on c, d. This could be done by straightforward inspection of the proof of Theorem 
5.1 [RT08bj. ■ 

Now we are ready to prove the main simulation of this subsection: 

Theorem 5.4. For any linear order on the variables, OFPC polynomially simulates R°(lin) 
(over large enough fields). Moreover, we can assume that all formulas appearing in the 
OFPC proofs simulating R°(lin) are depth-3 ordered formulas. 

Formally, we have a straightforward translation of CNFs to the language of R°(lin) (see [RT08a]). 
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By Propositions 5.2 and 5.3 and by the definition of OFPC, in order to prove Theorem 5.4 
it suffices to prove the following lemma (implicit in [RT08a]): 



Lemma 5.5 (Implicit in Lemma 9.14 [RT08a]). Let p be a polynomial translation of an R c ^(lin)- 

line of width w over n variables. Then, M[p] can be computed by an ordered formula of size 
polynomial inw-n over fields of size bigger than w-n. Moreover, the ordered formula is a SITS 
formula . 

Proof. The proof uses the fact that R C] d(lin)-lines are close to a product of d symmetric poly- 
nomials, and the fact that symmetric polynomials can be computed by small ordered formulas 
(of depth-3) over large enough fields. Specifically: 

Claim 5.6 (Restatement of Claim 9.15 in [RT08a]). Let D be a polynomial translation of an 
R Ci ,i(lin)-line of width w. Then, D is a linear combination (over ¥) of (w + c) c d many terms, 
such that each term is of degree at most w and can be written as 



IF 



A h , (4) 



where K is a collection of indices such that \K\ < c • d, and r^ 's are non-negative integers < w, 
and the Zf- 's are homogenous linear forms such that each z^ has a single integral coefficient for 
all variables in it?, and q is a polynomial translation of a clause. 

By this claim, to complete the proof of Lemma 5.5 it is sufficient to show that the multilin- 
earization of any term as in (4): 



M 



z k 



(5) 



"II 

keK 

can be computed by an ordered SITS formula of size polynomial in cdn, over fields of size bigger 
than c • w. This is done by using polynomial interpolation, as shown (implicitly) in Claim 9.16 
in [RT08a]. More specifically, Claim 9.16 in [RT08a] demonstrated that (5) can be computed 
by a formula <1> such that: (i) $ consists of polynomially in d, c many summands; (ii) each of 
these summands is a depth-3 SITS formula, in which every product gate is a product of linear 
forms; (iii) and each of these linear forms consists of only a single variable. 

Note that any such formula & is also an ordered formula, since the products are of linear 
forms, each of a single variable, one can order the products in a way that respects the underlying 
variable order ^. □ 

5.0.2. Corollaries: short proofs and separations. For natural numbers m > n, denote by 
-iFPHP™ the following unsatisfiable collection of polynomials: 

Pigeons: Vi G [m], (1 - a^i) • • • (1 - x i>n ) 

Functional : Mi G [m] V7c < £ G [n], ■ x^i (6) 
Holes : Vi < j G [m] Vfc G [n], x^j- ■ Xj t k 



As a corollary of the polynomial simulation of R°(lin) by OFPC, and the upper bounds on 
R°(lin) proofs demonstrated in [RT08a], we get the following result: 

Corollary 5.7. For any linear order on the variables, and for any m > n there are polynomial- 
size (in n) OFPC refutations of the m to n pigeonhole principle FPHP™ (over large enough 
fields). 



This means that every path from the root to the leaf in the formula tree starts with a plus gate, and the 
number of alternation in the path between plus and product gates is at most two 
^That is, Zk = b ■ X^=i Xi j f° r some natural number b. 
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-iFPHP™ is a direct translation of the CNF formula for the m to n functional pigeonhole 
principle. Thus, by known lower bounds, OFPC is strictly stronger than resolution and is 
separated from bounded depth Frege. On the other hand, Razborov [Razb98] and subsequently 
Impagliazzo et al. [IPS99] gave exponential lower bounds on the size of PC-refutations of a 
different low degree version of the Functional Pigeonhole Principle. In this low degree version 
the Pigeons polynomials in (6) are replaced by 1 — (x^i + . . . + Xi >n ), for all i € [m]. It is not 
hard to show (via reasoning inside R°(lin)) that OFPC admits polynomial-size refutations also 
for this low-degree version of the functional pigeonhole principle. This shows that OFPC is 
strictly stronger than PC (under the size measures as defined for OFPC and PC). 

The Tseitin graph tautologies were proved to be hard tautologies for several propositional 
proof system. We refer the reader to [RT08a], Definition 6.5, for the precise definition of the 
(generalized, mod p) Tseitin tautologies. We have the following: 

Corollary 5.8. Let G be an r-regular graph with n vertices, where r is a constant, and fix 
some modulus p. Then, for any linear order on the variables there are polynomial- size (in n) 
OFPC refutations of the corresponding Tseitin mod p formulas -iTseiting jP (over large enough 
fields). 

This stems from the R°(lin) polynomial-size refutations of the Tseitin mod p formulas demon- 
strated in [RT08a]. From the known exponential lower bounds on PCR (and PC and resolution) 
refutation size of Tseitin mod p tautologies (when the underlying graphs are appropriately ex- 
panding; cf. [BGIP01, BSI99, ABSRW04]), and for the polynomial simulation of PCR by OFPC, 
we conclude that OFPC is strictly stronger than PCR. 

6. Useful lower bounds on product of ordered polynomials 

In this section we show that the ordered formula size of certain polynomials can increase 
exponentially when multiplying the polynomials together. We use this to suggest an approach 
for lower bounding the size of OFPC proofs in Section 6.1. We use a method of partial deriva- 
tives matrix introduced by Nisan to obtain exponential-size lower bounds on noncommutative 
formulas in [Nis91]. 

Proposition 6.1. Let ¥ be a field, X := {x\, . . . , x n } be a set of variables and ■< be some linear 
order on X. Then, for any natural numbers m < n and d < [n/m\, there exist polynomials 
fi-> ■ ■ ■ > fd from ¥[x\, . . . , x n ], such that every fi can be computed by an ordered formula of size 
0(m) and every ordered formula computing Y\i=i fi ^ as s ^ ze 2 n ^ . 

Proof. First, note that it is sufficient to prove the proposition for m = 2 and any d < \n/2\. 
(Because, assume that the proposition holds for m = 2 and any d < \n/2\. And let m',d' be 
such that m' < n and d' < [n/m'\. By assumption, for m = 2 and d' < [n/m'\ < [n/2\, there 
are fi, ■ ■ ■ , fd' from ¥[x\, . . . , x n ] that can be computed by ordered formulas of size constant 
[that is, 0(2), and hence of size 0(m')], and such that every ordered formula computing Yli=i fi 
has size 2 n W.) 

Thus, let m = 2 and d < \n/2\. Assume without loss of generality that the linear order -< is 
such that x\ < X2 ^ . . . ^ x n . Abbreviate the variables x±,...,Xd as yi,...,yd, respectively, 
and abbreviate the variables 2^+1, . . . , X2d as zi, . . . ,Zd, respectively (that is, the y^s and Zj's are 
just different notations for their corresponding variables, introduced to simplify the writing). 
We thus have y\ < . . . < yd ^ z\ < . . . ^ 

For every i = 1, . . . , d, define the following polynomial: 

fi ■= {Vi + Zi) . 

Define 

d d 

HARD d := Hf i = H(y i + z i ). 

i=l i=l 
16 



We show that every ordered formula of HARD^ (under -<) is of size at least 2^ d \ Note that 
HARD^ is a homogenous and multilinear polynomial of degree d. 

Recall that [[HARD^] is the noncommutative polynomial obtained from HARD^ by ordering 
the products in every monomial in accordance to the linear order ^. By definition of ordered 
formulas, it suffices to lower bound the size of noncommutative formulas computing [HARD^J. 
For this purpose we use a rank argument introduced in [Nis91]. Nisan defined the matrix Mk{f) 
associated with a noncommutative polynomial / as follows: 

Definition 6.1 ([Nis91]). Let f £ ¥{ xi, . . . , x n ) be a noncommutative homogenous polynomial of 
degree d. For every < k < d, we define Mfe(/) to be a matrix of dimension n k x n as follows: 
(i) there is a row corresponding to every degree k noncommutative monomial over the variables 
{xi, . . . ,x n }, and a column corresponding to every degree d — k noncommutative monomial over 
the variables {x±, . . . , x n }; (ii) for every degree k monomial ^# and every degree d — k monomial 
jV , the entry in Mk(f) on the row corresponding to ^ and column corresponding to JV is the 
coefficient of the degree d monomial ■ JV in f . 

Theorem 6.2 ([Nis91] Theorem 1). Let f be a degree r homogenous noncommutative polyno- 
mial. Then, every noncommutative formula computing f has size at least ^^, =0 rank (Mfc(/)) . 

In view of Theorem 6.2, it suffices to prove the following claim: 

Claim 6.3. For any < k < d we have rank(M fe ([HARD d ])) > (£) . 

Proof of claim: Consider the matrix MfcdHARD^]]). Let be the matrix obtained from 
A/fc([HARD^]) by removing all rows and columns excluding the following rows and columns: 

(1) the rows corresponding to degree k multilinear monomials containing only yi variables, 
such that the order of products in the monomial respects -< ; 

(2) the columns corresponding to degree d — k multilinear monomials containing only Z\ 
variables, such that the order of products in the monomial respects <. 

Consider a degree k monomial ^# = yi x ■ ■ ■ yi k , where i\ < . . . < if.. Let J = [d] \ . . . , if-}. 
We can denote the elements of J as {ji, . . . , jd-k}, where ji < ■ ■ ■ < jd-k- Observe that the 
monomial ^# has on its corresponding row in only zeros, except for a single 1 in the position 
(that is, column) corresponding to the degree d—k monomial j¥ = Zj x ■ ■ ■ Zj d _ k . (Indeed, note 
that the coefficient of the degree d monomial ■ jV in [HARD^J is 1.) 

Note that A& contains (^) rows corresponding to all possible degree k multilinear monomials 
^# in the y variables whose product order respect -<. Similarly, A^ contains (^) columns 
corresponding to all possible degree d — k multilinear monomials jY in the z variables whose 
product order respect <. By the previous paragraph: (i) each of the rows in A& has only one 
nonzero entry; and (ii) for every row, the nonzero entry is in a different column from those of 
other rows. We then conclude that A^ is a permutation matrix. Therefore: 



The claim follows since clearly rank(Afe) < rank(M^ ([HARD^]])) . Bciaim 

By the claim and by Theorem 6.2, we conclude that the ordered formula size of HARD^ is at 
least 





□ 
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6.1. A lower bound approach. Here we discuss a simple possible approach intended to 
establish lower bounds on OFPC proofs, roughly, by reducing OFPC lower bounds to PC 
degree lower bounds and using the bound in Section 6 (Proposition 6.1). 

Let Qi(x), . . . , Q m (x) be a collection of constant degree (independent of n) polynomials from 
¥[x±, . . . , x n ] with no common solutions in F, such that m is polynomial in n. Let fi(y), • • • , f n (y) 
be m homogenous polynomials of the same degree from ¥[yi, . . . , y^], such that the ordered 
formula size of each fi(y) (for some fixed linear order on the variables) is polynomial in n and 
such that the /i(y)'s do not have common variables (that is, each fi{y) is over disjoint set of 
variables from y). Suppose that for any distinct ii,...,id £ [n] the ordered formula size of 

UjfiM is 2Q(d) - 

Note: By the proof of Proposition 6.1, the conditions above are easy to achieve. Indeed, the 
fi(yi, z i) ,s defined in the proof of Proposition 6.1 have these properties: homogeneity, same 
degrees for all /j's and disjointness of variables, and an exponential increase in ordered formula 
size for products of the fi's. 

Consider the polynomials Qi(x), . . . , Q m {x) after applying the substitution: 

xi ^ My) . (7) 

In other words, consider 

Qi(fi(y), fn(y)), • • • , Q m (fi(y), f n (y)) . (8) 

Note that (8) is also unsatisfiable over F. We suggest to lower bound the OFPC refutation 
size of (8), based on the following simple idea: it is known that some families of unsatisfiable 
collections of polynomials require linear £l(n) degree PC refutations (where n is the number of 
variables). In other words, every refutation of these polynomials must contain some polynomial 
of linear degree. By definition, also every OFPC refutation of these polynomials must contain 
some polynomial of linear degree. 

Thus, assume that the initial polynomials Q = {Qi(x), . . . ,Q m (x)} in the xi,...,x n vari- 
ables, require linear degree refutations — in fact, an w(logn) degree lower bound would suffice. 
Thus, every PC refutation contains some polynomial h of degree w(logn). Then, we might 
expect that every PC refutation of its substitution instance (8) contains a polynomial g € F[y] 
which is a substitution instance (under the substitution (7)) of an a; (log n)-degree polynomial 
in the x variables. This, in turn, leads (under some conditions; see below for an example of 
such conditions) to a lower bound on OFPC refutations. An example of sufficient conditions 
for super-polynomial OFPC lower bounds, is as follows: every PC refutation of (8) contains a 
polynomial g so that one of g's homogenous components is a substitution instance (under the 
substitution (7)) of a degree w(logn) multilinear polynomial from F[xi, . . . ,x n ]. We formalize 
this argument: 

Example: conditional OFPC size lower bounds. (Assume the above notations and condi- 
tions.) If: every PC refutation of (8) that has polynomial in n number of proof-lines contains a 
polynomial g 6 F[yi, . . . , yg\ such that for some t < deg(g), the t-th homogenous component 
of g (that is, the sum of all monomials of total degree t in g) is a substitution instance (under 
the substitution (7)) of a deg ree ^(log/i) multilinear polynomial from F[xi, . . . 
Then: every OFPC refutation of (8) is of super-polynomial size (in n). 

Proof of example: It suffices to show that any ordered formula of g is of super-polynomial size in 
n. Note that breaking an algebraic formula into its corresponding homogenous components — 
according to the standard known procedure (cf. [Raz08], proof of Proposition 2.3) — is also 
applicable to ordered formulas: in other words, if g has a polynomial-size ordered formula then 
each of g's homogenous components has a polynomial-size ordered formula as well. 9 Thus, it 

^Assume we have an ordered formula $ and we want to construct the ordered formula $' fe - ) that computes the 
fc-th degree homogenous polynomial of (the polynomial computed by) We work by induction on the structure 
of the formula $: a plus gate in the original formula $ turns into a plus gate u with two children, such that if 
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suffices to show that every ordered formula of g^' is of size super-polynomial in n (because then 
g itself has super-polynomial size). 

By assumption, is a substitution instance of some degree w(logn) multilinear polynomial 
h € F[xi, . . . ,x n ]. Since is homogenous and all the /j(y)'s have the same degree and are 
homogenous, h must be homogenous too. Since h is multilinear we can write h = ^2j^jbjAij, 
where the Ai^s are multilinear monomials in the x variables and bj are coefficients from F. 
Now, consider some single monomial Ai from ^Zj^jbjAij. By multilinearity and homogeneity 
of h every other monomial Ai' ^ Ai in h must contain an X{ variable that does not appear in 
Ai. We can assign to such X{. Doing this for every monomial Ai' ^ Ai, we get that h (under 
this partial assignment to the x variables) is equal to bAi, for some coefficient b € F. In a 
similar manner, by disjointness of the variables in the /i(y)'s, there exists a partial assignment 
p : y — > {0}, such that gW f p i s just a substitution instance (under the substitution (7)) of a 
single degree w(logn) multilinear monomial in the x variables. This means that \ p is the 
product of w(logra) distinct fi(y)'s (multiplied by b). Therefore, by assumption on the /i(y)'s 
every ordered formula of is of size exponential in 2 w ( logn ), which is super-polynomial in n. 
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